🌔 moondream

a tiny vision language model that kicks ass and runs anywhere

Examples

Image

Example

What is the girl doing?
The girl is sitting at a table and eating a large hamburger.

What color is the girl's hair?
The girl's hair is white.

What is this?
This is a computer server rack, which is a device used to store and manage multiple computer servers. The rack is filled with various computer servers, each with their own dedicated space and power supply. The servers are connected to the rack via multiple cables, indicating that they are part of a larger system. The rack is placed on a carpeted floor, and there is a couch nearby, suggesting that the setup is in a living or entertainment area.

What is behind the stand?
Behind the stand, there is a brick wall.

About

Moondream is a highly efficient open-source vision language model that combines powerful image understanding capabilities with a remarkably small footprint. It's designed to be versatile and accessible, capable of running on a wide range of devices and platforms.

The project offers two model variants:

Moondream 2B: The primary model with 2 billion parameters, offering robust performance for general-purpose image understanding tasks including captioning, visual question answering, and object detection.
Moondream 0.5B: A compact 500 million parameter model specifically optimized as a distillation target for edge devices, enabling efficient deployment on resource-constrained hardware while maintaining impressive capabilities.

How to use

Moondream can be run locally, or in the cloud. Please refer to the Getting Started page for details.

Special thanks

Modal - Modal lets you run jobs in the cloud, by just writing a few lines of Python. Here's an example of how to run Moondream on Modal.

Name		Name	Last commit message	Last commit date
Latest commit History 318 Commits
.github/workflows		.github/workflows
assets		assets
clients		clients
moondream		moondream
notebooks		notebooks
recipes		recipes
tests		tests
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
batch_generate_example.py		batch_generate_example.py
gradio_demo.py		gradio_demo.py
requirements.txt		requirements.txt
sample.py		sample.py
webcam_gradio_demo.py		webcam_gradio_demo.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

🌔 moondream

Examples

About

How to use

Special thanks

About

Uh oh!

Uh oh!

Contributors 25

Languages

License

vikhyat/moondream

Folders and files

Latest commit

History

Repository files navigation

🌔 moondream

Examples

About

How to use

Special thanks

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Uh oh!

Contributors 25

Languages